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ABSTRACT 

We present the discovery of a massive, quiescent galaxy at z = 2.99. We have obtained a 
HST /WFC3 spectrum of this object and measured its redshift from the detection of a deep 4000 
A break consistent with an old population and a high metallicity. By stellar population modeling 
of both its grism spectrum and broad-band photometry, we derive an age of ~ 0.7 Gyr, implying a 
formation redshift of z > 4, and a mass > 10 11 M Q . Although this passive galaxy is the most distant 
confirmed so far, we find that it is slightly less compact than other z > 2 early-types of similar mass, 
being overall more analogous to those z ~ 1.6 field early-type galaxies. The discovery of this object 
shows that early-type galaxies are detectable to at least z = 3 and suggests that the diversity of 
structural properties found in z = 1.4 — 2 ellipticals to earlier epochs could have its origin in a variety 
of formation histories among their progenitors. 

Subject headings: galaxies: evolution — galaxies: formation — galaxies: high-redshift 



1. INTRODUCTION 

In the nearby Universe, most stars reside in mas- 
sive, passively evolving spheroidal systems, either spiral 
bulges or elliptical galaxies (Baldry et al. 2004). Early- 
type galaxies (ETG), such as ellipticals and lenticulars, 
are often the most massive galaxies in their surround- 
ing environment and ETGs include the dominant stel- 
lar component in clusters of galaxies. As far as they 
can be found in significant numbers, massive ETGs ex- 
hibit very homogeneous spectrophotometric properties, 
requiring that their stellar content formed rapidly at rel- 
atively high redshifts and evolved passively for most of 
the cosmic time. In particular, studies of the core ETG 
population in galaxy clusters suggest that the quenching 
of their star formation (SF) was already underway by 
z~3. 

The precursors of z < 1 massive elliptical galaxies can 
arguably be found in some high-redshift submillimeter- 
selected galaxies, whose extreme star-formation rates 
(SFRs, in excess of 1000 M Q yr" 1 ; Daddi et al. 2009; 
Michalowski et al. 2010) imply that they should exhaust 
their gas reservoirs on short time scales (e.g., Carilli 
et al. 2010) and thus offer a striking example of rapid, 
unsustainable growth at early epochs. The spheroidal 
morphology of ETGs might then be either the conse- 
quence of merging events, triggering the starburst or 
happening after the quenching of SF, or dynamical 
instabilities internal to the galaxies (e.g., Martig et al. 
2009; Bournaud, Elmegreen & Martig 2009). This latter 
process may actually be the dominant one, given that 
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the vast majority (~ 
(Emsellem et al. 2011). 



of ETGs arc fast rotators 



One would therefore expect to find quenched, massive 
ETGs already at z ~ 2 — 3 and indeed passively evolving 
galaxies (PEGs; generally color selected as passive and 
not necessarily morphologically early-type) have been 
spectroscopically confirmed up to z ~ 2.5 (van Dokkum 
et al. 2008). Most of these early passive systems appear 
to be up to a factor ~ 10 denser than their modern 
counterparts (e.g., Daddi et al. 2005; Cimatti et al. 
2008; van Dokkum et al. 2008). However, the search 
for high-redshift ETGs is hampered not only by the 
expected rarity of such objects, but also by the intrinsic 
difficulty in reliably detecting them and measuring their 
redshifts. The most distinctive spectral feature of ETGs, 
the pronounced metal break at 4000 A is redshifted to 
the near-infrared at z > 1 and detection, let alone char- 
acterization, of z ~ 2 ETGs requires good continuum 
signal to noise (S/N) only achievable with extremely 
long exposures from the ground. Consequently, the 
sample of confirmed z > 2 ETGs is still small, and 
its high-redshift tail, i.e., the earliest epoch at which 
passive galaxies can be found, poorly constrained. 

Here we present the serendipitous discovery of the 
most distant spectroscopic ETG so far identified, i.e., 
a massive and passive galaxy at z ~ 3. Through- 
out the Letter, we assume a ACDM cosmology with 
H = 70 km s^ 1 Mpc" 1 , Q m = 0.27 and A = 0.73. Mag- 
nitudes are given in the AB photometric system. 

2. OBSERVATIONS 

This object (hereafter RS-235) was discovered in 
deep HST /WFC3 observations of the z ~ 2 cluster 
CI J1449+0856 (Gobat et al. 2011) at 14:49:16.5, 
+8:55:34.7, about 56" from the center of the structure. 
These data cover a 6.4 arcmin 2 field centered on CI 
J1449+0856 and consist of 2 orbits of imaging with 
the FU0W filter and 16 orbits of spectroscopy with 
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Figure 1. Left: 2" .6 X 2".6 HST/WFC3 FW0W image of RS-235. The image is 2".6x2".6 wide, with a pixel size of 0".06. Right: 
two-dimensional spectra of RS-235 in each of the three orientations, between 1 and 1.8 |im, smoothed with a 2 pixel bin. The bottom 
spectrum suffers from slight contamination between 1.25 and 1.5 fim from the tail of a bright, parallel second order trace and the two 
others have overlapping traces below 1.2 /im. Each pixel is 46.5 A (~ 11.7 A in rest-frame) wide in the dispersion direction and ~0".13 in 
the spatial (vertical) direction. The red lines bracketing the spectrum show the position of the 4000 A break and H,5, respectively. 



the G141 grism, in three different orientations. The 
individual frames were sky-subtracted and reduced 
using the best available calibration files. Standard 
procedures were applied: the WFC3 images were 
combined with MultiDrizzle and grism spectra for each 
orientation extracted with the aXe pipeline (Kummel et 
al. 2009). 6 The FUOW image of RS-235, as well as 
the two-dimensional spectra in the three orientations, 
is shown in Figure 1. Trace contamination was modeled 
using the F1AQW deep image (muo = 26.5 AB at 5er in 
a 1" aperture) and the multiwavelength dataset already 
available for this field. This extensive set of ground- 
based observations was first described in Gobat et al. 
(2011) and has recently been extended with deep U and 
V-band VLT/FORS2 imaging and deeper 3.5-4.5 /im 
Spitzer/IRAC imaging, in addition to the HST/WFC3 
data. At longer wavelengths, 24 /im Spitzer/MIPS 
data were already available and Herschel /PAGS 100 
and 160 /im observations have recently been obtained 
to expand the dataset (the maps reach a 5cr depth 
of 1.5 and 2.8 mJy respectively and will be described 
in a forthcoming paper). We have used this new 
multiwavelength ([/-band to IRAC) catalog to compute 
photometric redshifts for all objects in the HST /WFC3 
field with EAZY using the standard set of templates 
(Brammer et al. 2008, 2011). 

RS-235 stood out in this dataset as having a 2D spec- 
trum with a pronounced break at ~ 1.6 /im, a well- 
constrained photometric redshift of z ~ 3 and rest-frame 
UVJ colors consistent with an old stellar population 
(Wuyts et al. 2007). Furthermore, RS-235 is not seen 
at 24, 100 or 160 /im although, due to its redshift and 
the depth of the maps, this non-detection only provides 
very loose constraints on its SFR, with an upper limit 
of SFR< 400 Mq yr _1 assuming recent templates from 
Magdis et al. (2012). 

To ensure that the spectrum of RS-235 is free of con- 
tamination so that its continuum shape can be measured 
reliably, we performed, for each of the three WFC3 ori- 
entations, two different extractions from the cutout pro- 
duced by aXe. We first used a 3 pixel aperture, roughly 
corresponding to the FWHM of the object, and linearly 
fitted the observed spectrum with the combined contam- 

6 See e.g., http://www.stsci.edu/hst/wfc3/analysis/grism_obs/ 
cookbook.html 



ination and trace models. We compared the resulting 
one-dimensional spectrum with that obtained from fit- 
ting each cross-dispersion column of the cutout with pro- 
files derived, for each trace, from the profile of the corre- 
sponding object in the F140W image. The decontami- 
nated spectra were then stacked to produce a single high 
S/N (~ 11 pixel -1 at 1.6 /im) spectrum and converted to 
flux units. We note that, while the first orientation suf- 
fers from moderate contamination between 1.25 and 1.5 
/im (from the tail of a bright non overlapping trace) and 
the other two below 1.25 /im (from overlapping traces), 
the region redward of 1.5 /im where the break is seen is 
entirely free from contamination in all three cases. The 
sampling of the spectrum is 46.5 A with the G141 grism 
and its FWHM resolution 126 A, as grism spectra are 
convolved with the brightness profile of the dispersed 
sources. In this case, the profile is sufficiently circular 
(b/a ~ 0.8) that it was not necessary to optimize the 
orientation of the pseudo-slit during extraction. 

3. ANALYSIS 

As shown in Figure 2, the stacked spectrum shows 
two distinct spectral features: the break at 1.6 /im and 
an absorption feature (Us) at 1.636 /im. Their po- 
sition suggests a redshift of z ~ 3, which was mea- 
sured by comparison with templates (see below), yielding 
z = 2.993 ± 0.015. As its spectrophotometry precludes 
dusty SF solutions (see below), this makes RS-235 the 
most distant PEG confirmed so far. 

3.1. Spectrophotometry modeling 

To estimate the stellar mass and characterize the stel- 
lar population of the galaxy, we compared both the com- 
bined 15-band spectral energy distribution (SED) and 
high S/N part (1.4-1.65 /im) of the stacked grism spec- 
trum to a range of stellar population models. These were 
built using high resolution Bruzual & Chariot (2003) and 
Maraston (2005) (hereafter BC03 and M05) single stel- 
lar population templates, assuming Z = (1 — 2)Zq (see 
below), a Salpeter (1955) initial mass function (IMF). 
We considered two different sets of star formation his- 
tories (SFH): a delayed, exponentially declining SFH 
with characteristic time scale r between 1 Myr and 
2 Gyr and ages from 100 Myr to the age of the Uni- 
verse at z = 3, and a rising exponential SFH of the form 
SFR(t) = M xAx e At , starting at z = 10. The first one 
can be considered a versatile, albeit simplistic, "generic" 
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Figure 2. Left: HST /WFC3 G141 spectrum of RS-235 (black) and associated errors (gray), after subtraction of the contamination and 
stacking of the three orientations, compared with the best solution to the combined fit to the SED and spectrum (BC03 in red, M05 
in orange). Right: 15-band (U BV RI zY Jmno H K S +IIIAC) SED of RS-235 (blue) and associated errors (3cr upper limits are shown as 
arrows) with the same BC03 and M05 best solutions (red and orange, respectively) to the combined fit. 



SFH, while the second scenario is meant to put lower lim- 
its to the average SFR and formation time scales required 
to produce a galaxy such as RS-235. It is appropriate for 
a galaxy on the "main sequence" (MS) of star forming 
galaxies and relies on the apparent weak evolution of the 
specific SFR (sSFR) at z > 2.5 (e.g., Magdis et al. 2010; 
Gonzalez et al. 2010; Karim et al. 2011), which indeed 
implies an exponential growth. Its two parameters are 
the initial mass Mq of the galaxy at the beginning of MS 
evolution, and the parameter A setting the time scale 
and determined by the normalization of the SFR-M* re- 
lation at z ~ 2.5. A shallower slope of the SFR-Af* 
relation, e.g., a slope ~ 0.8 found by Rodighiero et al. 
(2011), would imply a power- law SFH with SFR(t) oc t A , 
whereas the exponential form has a simpler dependence 
on Mo. We allowed values up to 0.7 dex below and above 
the MS. 

In both cases we allowed for a SF cut-off at an arbi- 
trary time to be determined by the best fit procedure, 
with an option for a secondary burst after the initial 
quenching contributing up to 50% of the final mass. This 
addition is motivated by the detection of the galaxy in 
the rest-frame far-UV (the V band, as shown in Figure 
2) implying the presence of a small amount of massive 
stars. Note that the detection in B is marginal, at 
> 3ct, and might be due to contamination from a nearby 
source. Dust extinction was also added using a Calzetti 
et al. (2000) extinction law and recombination lines were 
computed using Gaussian profiles of matching resolution, 
assuming the SFR- f/a calibration of Kennicutt (1998) 
and standard line ratios (Anders & Fritze-v.Alvensleben 
2003). The model spectra were convolved with the 
filters responses to produce synthetic SEDs, smoothed 
to the resolution of the grism spectrum using a Gaussian 
kernel and rebinned to match the observed spectrum. 
The results of the fit are shown in Figure 2 and in Table 1 . 

Due to the low resolution of the grism spectrum, 
the spectral fit is mainly constrained by the shape 
of the continuum, namely the break at ~1.6 /im and 
the blueward slope of the spectrum, and the lack of 
prominent spectral features. Notably, the spectrum 



does not show any [Oil] 3727 emission. We find a mean 



stellar age of tsF — 0.7 



+0.15 
-0.1 



Gyr (corresponding to 



Zf > 4) , with a last episode of SF contributing less than 
10% of the final mass of M* = 1.2±g;| x 10 11 M©. For 
the delayed exponential SFH, the fit yields time scales 
of < 200 Myr. In the exponential SFH case, solutions 
with sSFR below the fiducial MS value (~ 3 Gyr" 1 
at 10 11 Mq, using Bethermin et al. (2012)) require 
apparently unrealistic seed masses of > 10 9 Mq at 
z = 10 (see e.g., M. Sargent et al., in preparation). 
In this case, the fit implies doubling time-scales of 
220 Myr or less. The amount of extinction compatible 
with the SED is small, E(B - V) < 0.1, and the fit 
accordingly rules out star-forming solutions, yielding a 
negligible residual SFR of - 0.5 M Q yr" 1 . RS-235 
thus appears to have sSFR of less than 5 x 10~ 3 Gyr" 1 , 
at least 2.7 dex below the MS of star forming galaxies 
at z ~ 3. Interestingly, models based on Maraston 
(2005) templates provide a less good fit to the rest- 
frame optical-NIR SED, underpredicting the flux in the 
observed K s band (rest-frame 5000-6000 A). As RS-235 
is still a young stellar population, this could be due to, 
e.g., the effect of thermally pulsing-asymptotic giant 
branch stars in the M05 models (which would produce 
a shallower Rayleigh- Jeans slope) or the presence of 
convective overshooting in the stellar tracks used by the 
BC03 models (Maraston et al. 2006). 

3.1.1. Metallicity 

Interestingly, solar metallicity models are rejected at 
> 3<7 by the stellar population modeling. Quantifying 
the strength of the 4000 A break using the D n 4000 index 
(Baloghet al. 1999), we find D n 4000 = 1.62±0.08 (where 
uncertainties were estimated through Monte Carlo simu- 
lations based on the error spectrum). As shown in Figure 
3, this value is comparable to that of a maximally old 
(zf = 10 SSP) Z = Zq stellar population, implying that 
this is a lower limit to the metallicity of RS-235 and sug- 
gesting that the strength of the break requires not only 
a passive stellar population but also a stellar metallic- 
ity above the solar value. We note that the significantly 
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Table 1 

Physical and Structural Parameters from Modeling 



z D n 4000 logM* t a t SF h r e n 
(M ) (Gyr) (Gyr) (") 

2.993 ±0.015 1.62±g;g| 11.08±g;i| 0.9±g | 0.71°;^ 0.16±°;{g 1.8 ±0.3 



a Look-back time to the beginning of star formation 

b SF-weighted age, i.e., look-back time at which the galaxy formed half of its stars 
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Figure 3. Dn4000 index as a function of age and redshift, for 
unreddened SSPs of solar and twice solar metallicity formed at 
z = 10. The dashed horizontal line and gray area show the value 
of Dn4000, and associated uncertainty, of RS-235 and the vertical 
line marks its redshift. 

worse resolution of the HST /WFC3 G141 spectra, com- 
pared to that of the slit spectra of lower redshift objects, 
on which the D n 4000 index is usually measured, can af- 
fect the apparent strength of the break. While this effect 
depends on the stellar population mix and is most pro- 
nounced in the case of SSPs, we find that at our effective 
resolution the correction is negligible for the best fitting 
template to the spectrum. 

3.2. Structural parameters 

RS-235 is a resolved object in the F140W image, with 
a FWHM size of ~ 2.5' resolution elements (~0".32). 
We measured its morphological properties by fitting the 
surface brightness profile with a Sersic model (Sersic 
1963) using GALFIT 3 (Peng et al. 2010). We con- 
structed a median point-spread function by combining 
six unsaturated, high S/N stars in our HST/WFC3 
field. RS-235 is relatively isolated, with the nearest 
sources being at least 2". 3 away and 1.5 mag fainter. 
The morphological analysis is therefore stable whether 
we include the closest neighbors or not. From a series of 
GALFIT simulations we find that the bias on structural 
parameters is negligible for a 24 mag object like this 
galaxy. The scatter on r e and n is about 10% and 
20% respectively, which we adopt as the standard error 
on the parameters. Accordingly, the Sersic fit yields 
a circularized effective radius of r e — 0.16±0".02, or 
1.3 kpc at z = 2.99, and a Sersic index of n — 1.8 ± 0.3. 
In Figure 4 we compare structural and stellar population 
parameters of RS-235 with samples of PEGs in the 
field at z ~ 1.6 (Onodera et al. 2012) and z <~ 2.3 (van 
Dokkum et al. 2008; Kriek et al. 2009). 



We note that we can not a priori discount the possibil- 
ity that RS-235 be lensed by the cluster CI J1449+0856, 
as it is relatively close to it, and its observable proper- 
ties (mass and size) therefore affected. To estimate this 
effect, we assumed a simple lens model with a single halo 
of mass 10 14 M Q (conservatively twice the X-ray mass, 
to account for the fact that the structure is likely not 
entirely virialized) at z ~ 2, centered on the X-ray emis- 
sion (sec Gobat et al. 2011). Considering both singular 
isothermal or Navarro, Frenk & White (1996) profiles 
with c > 1, we derive a magnification of less than 3%, 
almost a factor 10 lower than the uncertainties on cither 
mass or size and thus negligible. We similarly estimate 
that the closest projected neighbors of RS-235 do not 
generate any appreciable lensing effect. On the other 
hand, we find a magnification of up to 7% for a rela- 
tively bright spiral galaxy at 5". 2 and z p hot = 0.9, im- 
plying that the stellar mass could be overestimated by 
this much and the size by ~ 3%. Although these factors 
arc lower than the systematic error on either parame- 
ter, we added 7% and 3% to the lower uncertainties on 
the mass and circularized effective radius, respectively, 
to take the effect of magnification into account. 

4. DISCUSSION 

RS-235 appears to be a quenched, if not entirely 
passive galaxy at z = 2.99, the most distant spectro- 
scopically confirmed so far. Its spectrophotometric 
properties suggest that this galaxy formed the bulk of 
its stars relatively quickly at z > 4 and already has 
a high metallicity, comparable to that of local ETGs 
(e.g., Gallazzi et al. 2006). With n ~ 1.8, it has a 
flatter profile than the typical passive population at 
lower redshifts and appears to have not acquired yet 
a characteristic ETG morphology. As with the vast 
majority of z > 1.5 PEGs, it is also more compact (2-3 
times below the local mass-size relation for ETGs; Shen 
et al. 2003) than present-day ETGs, although apparently 
less so than other z > 2 passive galaxies of similar mass 
and rather well within the variation of z ~ 1.6 galaxies 
shown in Figure 4. We note that our size measurement 
was made on a rest-frame near-UV (350 nm) image, 
as were the measurements by Onodera et al. (2012), 
while van Dokkum et al. (2008) used rest-frame optical 
(480 nm) data. Based on the stellar population mod- 
eling, we estimate that the flux contribution of young 
(< 500 Myr) stars increases only by 10%-30% from 480 
to 350 nm, thus, this younger component, if present, 
should not affect much the effective radius measurement. 
As, additionally, the morphology of high-redshift ETGs 
has not been observed to vary significantly between 
the rest-frame near-UV and optical (e.g., Cassata 
et al. 2012), it is unlikely that the somewhat lower 
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Figure 4. Comparison of spectral and structural properties of RS-235 and other high redshift field PEGs (Onodera et al. 2012; Kriek et 
al. 2009; van Dokkum et al. 2008), excluding objects with disk-like (n < 1) morphology. Top, left: Dn4000 as a function of redshift. The 
black line shows the evolution of Dn4000 of RS-235, if the galaxy were to evolve passively to lower redshifts. Top, right: Dn4000 vs. the 
circularized effective radius as a fraction of the value expected from the local mass- morphology relation (Shen et al. 2003). Bottom, left: 
circularized effective radius as a function of the stellar mass (converted to a Chabrier (2003) IMF), with the local relation shown as a gray 
line. 



surface density of RS-235 compared to the van Dokkum 
et al. (2008) sample could be due to a positive age 
gradient. On the other hand, the population of PEGs 
at z = 1.4 — 2 has been found to be diverse in terms 
of structural properties with some early-types having a 
size comparable to that of local ETGs (Saracco et al. 
2011; Onodera et al. 2010; Mancini et al. 2010). The 
existence of a z = 3 passive galaxy, very close to its 
formation epoch, as extended as the z ~ 1.6 population 
reinforces this picture and suggests that the diversity of 
z < 2 ETGs could reflect not only a spread in assembly 
histories but also the existence of different formation 
pathways for these early ETGs. 

For example, the compactness of high-redshift ETGs 
is expected to depend on whether they were produced 
through gas-rich or gas-poor mergers (e.g., Springel 
& Hernquist 2005; Bournaud et al. 2011), or through 
secular gas exhaustion. However, the available data do 



not allow us to favor a particular formation scenario, 
although the relatively low Sersic index is not indicative 
of a recent (major) merger. On the other hand, the 
spectrophotomctric fit tends to favor solutions with 
short doubling time scales and accelerated evolution at 
high redshift, with respect to our fiducial MS. 

Finally, as shown in Figure 4, there are five non-disky 
PEGs in the Onodera et al. (2012) sample with D n 4000 
equal or higher than that of RS-235 at the same epoch, 
assuming passive evolution of the best fit models. These 
galaxies are likely to be at least as old as RS-235 and 
to have been already quenched at z ~ 3. Their surface 
density, 0.08±0.04 arcmin" 2 , is compatible with our sin- 
gle detection in a 6.4 arcmin 2 field, suggesting that they 
and RS-235 come from the same early PEG population. 
It also implies that confirmation of further z ~ 3 PEGs 
would require relatively few appropriately deep WFC3 
pointings and should thus be quite feasible, although the 
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response of the G141 grism will limit the identification 
of the 4000 A break to z < 3.1 until newer instruments, 
such as NIRSpec on the James Webb Space Telescope, 
become available. 

RG, VS, ED and MB were supported by grants ERC- 
StG UPGAL 240039 and ANR-08-JCJC-0008. 
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